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PARALLEL PROCESSOR LANGUAGE, METHOD FOR TRANSLATING C++ 
PROGRAMS INTO THIS LANGUAGE, AND METHOD FOR OPTIMIZING 
EXECUTION TIME OF PARALLEL PROCESSOR PROGRAMS 

CROSS-REFERENCE TO RELATED DOCUMENTS 
[0001] The present application herein incorporates United States Patent Application 
Serial No. 10/621,737, filed July 17, 2003, by reference in its entirety. 

FIELD OF THE INVENTION 
[0002] The present invention generally relates to the field of integrated circuits, 
particularly to a parallel processor language, and more specifically to a method for 
translating C++ programs into a parallel processor language and a method for optimizing 
execution time of a parallel processor program. 

BACKGROUND OF THE INVENTION 
[0003] Integrated circuits (ICs) are developed in order to process information. Every IC 
has at least one input and at least one output. An IC receives input data, processes the 
data and then produces the result. In many cases, ICs are defined by means of high-level 
languages such as the C++ programming language. One type of ICs, called a processor, 
is capable of executing programs that are written using a special language consisting of 
small simple commands instead of the C++ programming language. A language that 
consists of simple commands is defined as a processor language, and a program written 
in a processor language is defined as a processor program. 

[0004] For example, the following C++ command: 

Z = A+B+C 



2 



03-0929 



may be presented as a sequence of simple commands that can be processed by a 
processor IC: 

1. Take A. 

2. Take B. 

3. Evaluate A+B. 

4. Take C. 

5. Evaluate (A+B)+C. 

6. Move the result to Z. 

[0005] It is easy to see that the result of such a presence of any C++ command is the 
sequence of several simple commands. In the above example, one C++ command is 
translated into 6 simple commands. Consequently, if a processor executes one simple 
command per one clock cycle, it may take 6 clock cycles to execute the considered C++ 
command. Thus, one of the most important problems is to minimize the time for a 
processor to execute a C++ program and to minimize the size of a processor program. 

[0006] Thus, it would be desirable to provide a processor language that allows some 
simple commands be carried out in parallel. This parallel processor language may consist 
of very small number of different commands so that the processor IC that executes these 
commands is easy to create, and the processor IC may have a small area and may be able 
to execute commands with high speed or high frequency. Moreover, it would also be 
desirable to provide a method for translating a C++ program into a parallel processor 
program, which method may minimize the size of the obtained processor program. 

[0007] A processor program loaded to a processor may be written in two different ways: 
(1) the program may be first written in some high-level programming language like C++, 
Pascal, etc., and the program may be then translated into the processor language; and (2) 
the program may be written in a processor language manually. In either case, it would be 
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desirable to provide a method (e.g., an algorithm, and the like) that allows the execution 
time of a processor program to be optimized (e.g., minimized). That is, it would be 
desirable to provide a method for optimizing execution time of a processor program 
written in a parallel processor language, which method may receive a processor program, 
optimize this program and output a resulted program that is functionally equivalent to the 
original one but has reduced execution time. 

SUMMARY OF THE INVENTION 
[0008] Accordingly, the present invention is directed to a parallel processor language, a 
method for translating C++ programs into a parallel processor language, and a method for 
optimizing execution time of a parallel processor program. In a first exemplary aspect of 
the present invention, a parallel processor program for defining a processor integrated 
circuit includes a plurality of processor commands with addresses. The plurality of 
processor commands may include a starting processor command, and each of the 
plurality of processor commands includes one or more subcommands. When the 
processor integrated circuit executes the parallel processor program, the processor 
integrated circuit executes the staring processor command first and then executes the rest 
of the plurality of processor commands based on an order of the addresses. Moreover, 
when the processor integrated circuit executes a parallel processor command, the 
processor integrated circuit executes all subcommands included in the parallel processor 
command in parallel in one clock cycle. 

[0009] In a further exemplary aspect of the present invention, a method for translating a 
C++ program into a parallel processor program with a minimum size defining a processor 
integrated circuit may include the following steps: (a) dividing the C++ program into a 
plurality of C++ functions allowed to call each other; (b) translating the plurality of C++ 
functions into a plurality of blocks TRANS(<C++ function>) written in a parallel 
processor language; and (3) concatenating the plurality of blocks TRANS(<C++ 
function>) into the parallel processor program. 
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[0010] In another exemplary aspect of the present invention, a method for optimizing 
execution time of a parallel processor program may include the following steps: (a) 
receiving a parallel processor program; (b) making dummy jumps optimization; (c) 
making linear code optimization; (d) making jumps optimization; (e) returning back to 
said step (b) when there is any change made in said steps (b), (c), and (d); and (f) 
outputting a resulted new processor program when there is no change made in said steps 
(b), (c),and(d). 

[0011| It is to be understood that both the foregoing general description and the 
following detailed description are exemplary and explanatory only and are not restrictive 
of the invention as claimed. The accompanying drawings, which are incorporated in and 
constitute a part of the specification, illustrate an embodiment of the invention and 
together with the general description, serve to explain the principles of the invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0012) The numerous advantages of the present invention may be better understood by 
those skilled in the art by reference to the accompanying figures in which: 
FIG. 1 shows an exemplary C++ program "ORDER"; 

FIG. 2 shows a result after the C++ program "ORDER" shown in FIG. 1 is 
translated into a parallel processor program in accordance with an exemplary 
embodiment of the present invention; 

FIG. 3 is a flow diagram showing an exemplary method for optimizing execution 
time of a parallel processor program in accordance with an exemplary embodiment of the 
present invention; 

FIG. 4 is a flow diagram showing an exemplary embodiment of the step of 
making dummy jumps optimization shown in FIG. 3 in accordance with an exemplary 
embodiment of the present invention; 
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FIG. 5 is a flow diagram showing an exemplary embodiment of the step of 
making linear code optimization shown in FIG. 3 in accordance with an exemplary 
embodiment of the present invention; and 

FIG. 6 shows a result after the program shown in FIG. 2 is optimized in 
accordance with an exemplary embodiment of the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 
[00131 Reference will now be made in detail to the presently preferred embodiments of 
the invention, examples of which are illustrated in the accompanying drawings. 

A. Considered C++ language 

[0014] As an example, C++ programs that include the following C++ constructions only 
are considered. However, it is contemplated that other C++ constructions may also be 
used without departing from the scope and intent of the present invention. 

(1) Unsigned integer variables 
[0015] An example is: unsigned i; 

(2) Unsigned one-dimensional arrays 
[0016] An example is: unsigned x[ 10]; 

(3) One or more C++ functions 

[0017] One of these functions must be the function void main (), which is the main 
function of the C++ program. All the functions may be of one of two types: unsigned 
and void. Each of the types is allowed to have some unsigned integer arguments and 
some unsigned integer local variables. 

[0018] The following is an example: 
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unsigned need _exchange (unsigned posl, unsigned pos2) { 
unsigned val; 



} 

In this example, the posl and pos2 are arguments of the function need_exchange, and the 
variable val is the local variable of this function. 

(4) The body of each function may include the following C++ operators 
only 

a. return-operator 
[0019] There may be two situations: 

- return <expression>; inside any function of the type unsigned; 

- return; inside any function of the type void. 

The term <expression> represents any C++ expression that includes C++ variables, 
unsigned integer constants, arithmetic-logical operations (+, -,*,/, %, <, <= >, >= == 
K U ~ &, |, A , &&, ||, «, », etc.) and calls of unsigned integer functions. An example 
may be: return need exchange (i, i+1); 

b. set-operator 

[0020] The representation may be: 

<variable> = <expression> , 

where the term <variable> is a name of an unsigned integer variable, or a name of an 
unsigned integer array with index expression: 
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< unsigned Jnteger jirray>[< index _expression> ]. 

[0021] The following are some examples: 

i = i+7; 
x[l] = i; 
x[2] = 0; 

xfi+lj = xfij + need_exchange(U i+1); 

c. void function call-operator 
[0022] The representation may be: 

<void Junction _name> (<listj)f_arguments> ); 

d. if-operator 

[0023] The representation may be: 

if (<expressiori> ) <positive_pperator> 
else <negative operator> 

where the <positive_operator> and <negative_operator> are any allowed C++ operators. 
Note that the branch else < negative operator> is optional. 

[0024] The following are some examples: 

if(x[0]==5)x[0]=0; 

or 

if(x[0]==5)xf0]=0; 
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else x[0]=x[0]+l; 

e. while-operator 

[0025] The representation may be: 

while (<expression>) <body> 

where <body> is any allowed C++ operator. An example is: while (xfiJ-^O) i=i+7; 

f. for-operator 
[0026] The representation may be: 

for (<at_start>;<expression>;<atJteration Jinish>) <body> 

where <at_start> and <at_iteration_finish> are the ^-operators or empty, and <body> is 
any allowed C++ operator. Since the wftz/e-operator may be considered as a /or-operator 
with empty operators <at_start> and <at_iteration_finish>, the wMe-operator will not be 
considered any further in the present application. 

g. begin-end-operator 
[0027] The representation may be: 

{ 



< 



operator 1 > 



<operator_N> 

} 

where <operator_l>,..., <operator_N> are any allowed C++ operator. 



9 



03-0929 



h. break-operator 
[00281 The representation may be: 

break; 

This operator may be placed inside the while and for operators. 

[0029] Now all possible C++ constructions allowed to be used in a C++ program 
considered in the present application have been provided. The information on the syntax 
and the meanings of these constructions may be found in any book that is devoted to the 
C++ programming language. 

[0030] FIG. 1 shows an exemplary C++ program called "ORDER". Those of ordinary 
skill in the art will understand that the exemplary C++ program "ORDER" shown in FIG. 
1 orders the unsigned integer array x in the ascending order if dir!=0 and orders the array 
x in the descending order if dir==0. The ordered array is stored in the unsigned integer 
array y. 

B. Processor language 

[0031] A processor program is a list of processor commands. Each processor command 
may include no more than N small commands called subcommands. In each clock cycle 
a processor IC executes one processor command. Thus, the processor IC may execute N 
subcommands in parallel in a clock cycle. As a result, a processor language and a 
processor program in accordance with the present invention are also called a parallel 
processor language and a parallel processor program, respectively. 

[0032| When N is large, a complicated processor IC has to be developed to carry out N 
subcommands simultaneously. On the other hand, the time of the processor program 
execution may become much shorter than is the case of small N. When N is small, a 
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simple processor may be required but the processor program is executed with low speed. 
In practice, N = 6 may be taken. 

[0033] A processor may carry out a processor program as follows. All commands of the 
processor program are enumerated by numbers 0, 1, 2, .... These numbers are called 
addresses of commands. The processor program may have a starting address that is the 
address of the command that should be executed first (starting processor command) when 
the processor begins the program execution. In each clock cycle, the processor carries 
out one processor command and carries out all its subcommands one-by-one. All the 
subcommands may be divided into 2 groups: jump-subcommands and non-jump- 
subcommands (all other subcommands). Jump-subcommands show the processor the 
next command to be executed. When the processor meets a jump-subcommand, the 
processor may stop execution of the current command and go (or jump) to the other 
command in the next clock cycle. When the processor does not meet any jump-command 
during execution of a command with address ADDRESS, then in the next clock cycle the 
processor executes the command with address ADDRESS+1. 

[0034] The processor program is allowed to use some predefined unsigned integer 
variables and some one-dimensional unsigned integer arrays of the constant length. 

[0035] All possible types of subcommands in a parallel processor program in accordance 
with the present invention may be described as follows. 

(1) Subcommand for finishing the processor program execution - FIN 

[0036] When a processor meets the command FIN, the processor stops the execution of 
the program. 

(2) Subcommands for reading and writing unsigned integer variables and 
arrays 
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[0037] Different variables and arrays may be accessed (read and written) independently 
from each other. Any variable may be read any time. To write a new value to a unsigned 
integer variable, one of the following two commands may be used: 

SET <variable_name_l> <variable_name_2> 

SETCONST <variable_name_l> <unsigned_integer_constant> 

[00381 The subcommands SET and SET CONST assign the value of the variable 
<variable_name_2> and the constant <unsigned_integer_constant> (respectively) to the 
variable <variable_name_l>. Both of these two subcommands take 1 clock cycle to 
perform the assignment, so the value of the variable <variable_name_l> cannot be 
written one more time inside the same processor command. 

[0039] The unsigned integer arrays need 1 clock cycle for either reading or writing. That 
means that one cannot both read and write, or read 2 times, or write 2 times inside the 
same processor command. The following are the subcommands for reading arrays: 

READ <array_name> <variable_name> 

READCONST <array_name> <unsigned_integer_constant> 

[0040] The subcommands READ and READ CONST read the value of the elements of 
array <array_name> with indexes equal to the value of variable <variable_name> and 
equal to the constant <unsigned_integer_constant>, respectively. Since array reading 
takes 1 clock cycle, the value read may be used in the next processor command and later 
only. This value may be used starting from the next processor command (or next clock 
cycle) while other reading or writing operation with the array <array_name> is not 
performed. 

[0041] For example, consider a small C++ program: 
y=x[ij;z = 0; 
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[0042| The following processor program may perform the same action as the above C++ 
program: 

0: READ x i; 

1: SETjOONSTz 0; 

2:SETyx; 

3: FIN; 

where the array x is read in the command with the address 0, so the value x may be used 
only in commands 1,2,.... 

[0043] The following is a list of subcommands for writing arrays: 

WRITEVARVAR <array_name> <variable_name> <index_variable_name> 
WRITEVARCONST <array_name> <variable_name> <index_constant> 
WRITECONSTVAR <array_name> <constant> <index_variable_name> 
WRITE CONST CONST <array_name> <constant> <index_constant> 

[0044] The subcommands WRITE VAR VAR and WRITE_VAR_CONST write the 
value of the variable <variable_name> to the element of the array <array_name> with 
index equal to the value of the variable <index_variable_name> and equal to the 
unsigned integer constant <index_constant>, respectively. The subcommands 
WRITE CONST VAR and WRITE CONST CONST write the constant <constant> to 
the element of the array <array_name> with index equal to the value of the variable 
<index_variable_name> and equal to the unsigned integer constant <index_constant>, 
respectively. These 4 subcommands take 1 clock cycle to perform a writing operation. 

(3) Arithmetic-logical operations evaluation 

[0045] There is a special arithmetic-logical device (called ALD) that serves to evaluate 
arithmetic and logical operations that are included in the C++ expressions. All the 
arithmetic-logical operations (+, - 5 *, /, %, <, <= >, >= =, != !, ~ &, |, * ? &&, ||, « ? 
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», etc.) are enumerated. Each of these operations has its own unsigned integer 
identifier. The maximal number of arguments of these operation is 2. 

[0046] There are 4 special variables: ALDO, ALDl, ALDOP, ALDZ with the 
following meanings. The variables ALDO and ALDl store values of operation 
arguments, and the variable ALDOP stores the identifier of the operation. The variables 
ALD O, ALDl and ALD OP are write-only variables. The variable ALD_Z is a read- 
only variable that stores the result of the operation ALDOP applied to the values of 
variables ALD O and ALDl. In order to evaluate the operation one may write the 
corresponding values to the variables ALD O, ALDl and ALD OP (using 
subcommands SET and SETCONST described above) and read the variable ALD__Z. 
Note that because writing values of the variables ALD O, ALD l and ALDOP takes 1 
clock cycle, the value of the variable ALD Z is allowed to be read one clock cycle later 
after the values of the variables ALD O, ALD l and ALD OP are written. 

[0047] For example, consider a C++ program: 

y = /+/ 

[0048] This program may be translated into a processor program as follows: 

0: SET ALDO i; SET AID J I; SET ALDjDP identifier >(+); 
1: SETy ALD Z; 
2: FIN; 

where the command with address 0 consists of 3 subcommands that write values of 
variables ALD O, ALD l and ALD_OP in parallel. 

(4) Storing temporary results of evaluations 

[0049| During the evaluation of complicated C++ expressions, it may be necessary to 
keep temporary results. There is a stack EVAL STACK that is used for this purpose. 
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This stack is based on the principle LIFO (last in first out). There are several 
subcommands for putting, reading and removing values from the stack EVALSTACK. 

PUT <variable_name> 

PUTCONST <unsigned_integer_constant> 

DROP <unsigned_integer_constant> 

[00501 The subcommands PUT and PUTCONST put the value of the variable 
<variable_name> and the constant <unsigned_integer_constant> (respectively) to the 
stack EVALSTACK. The subcommand DROP removes the 

<unsigned_integer_constant> values from the stack EVAL STACK. These 3 commands 
each take 1 clock cycle to perform the actions with the stack. That is why one is not 
allowed to use 2 or more subcommands PUT, PUT CONST and DROP inside the same 
processor command. The values that are stored in the stack EVAL STACK may be read 
using variables STCK_0, STCKl, etc. The variable STCK 0 stores the value that was 
put in the stack last. The variable STCK l stores the value that was put in the stack 
second to the last, etc. In the following example how to use subcommands to control the 
stack EVAL STACK is shown. 

[0051] Consider a C++ program: 

i=j;y = 0; 
[0052] This program may be translated to: 

0: PUTj; 

1 : PUTJCONST 0; 

2: SETi STCK l; SETy STCK0; DROP 2; 
3: FIN; 

[0053] Note that the subcommands PUT, PUT CONST and DROP are not placed inside 
the same processor command. 
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(5) Calling functions 

[0054] There are 2 subcommands for calling a function and returning back from the 
function: 

CALL <address> 
RETURN 

[0055] These two subcommands are jump-subcommands. The subcommand CALL 
causes the processor to go to the command with the address <address>. The 
subcommand RETURN returns the processor to the command that follows the command 
containing the last performed subcommand CALL. 

(6) All other j u mp-su bcomm ands 

[0056] The following is a list of all other jump-subcommands: 
JUMP <address> 

ZEROJUMP <variable_name> <address> 
NONZEROJUMP <variable_name> <address> 
LOOPINCNOLESS <variable_name> <constant_value> <address> 
LOOP_INC_NOMORE <variable_name> <constant_value> <address> 
LOOPDECNOLESS <variable_name> <constant_value> <address> 
LOOPDECNOMORE <variable_name> <constant_value> <address> 

[0057] The subcommand JUMP indicates that the processor jumps to the command with 
address <address>. Note that the term "jump to the command with the address 
<address>" means that the processor executes the command with address <address> in 
the next clock cycle instead of the command that follows the current command. 

[0058] The subcommands ZERO JUMP and NONZERO_JUMP read the value of the 
variable <variable_name>. If the value read is equal to 0 (ZERO JUMP) or this value is 
not equal to 0 (NONZEROJUMP), then the processor stops execution of the current 
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command and jumps to the command with address <address>. Otherwise, the processor 
continues execution of the current command. 

[0059] More complicated actions are performed when the processor meets the commands 
LOOP_INC_NOLESS, LOOP_INC_NOMORE, LOOP_DEC_NOLESS, 

LOOP DEC NOMORE. In this case the processor may perform the following actions: 

a) The processor checks if the value of the variable <variable_name> is 
not less than the constant <constant_value> for subcommands 
LOOPINCNOLESS and LOOPDECNOLESS, and the processor checks if 
the value of the variable <variable_name> is not more than the constant 
<constant_value> for subcommands LOOPINCNOMORE and 
LOOP_DEC_NOMORE; 

b) The processor increases the value of the variable <variable_name> by 1 
for subcommands LOOP_INC_NOLESS and LOOP_INC_NOMORE, and 
decreases the value of the variable <variable_name> by 1 for subcommands 
LOOP_DEC_NOLESS and LOOP_DEC_NOMORE; 

c) If the result of checking in a) above is true, then the processor stops 
execution of the current command and jumps to the command with address 
<address>. Otherwise, the processor continues execution of the current 
command. 

[0060] Note that each of 4 subcommands LOOP* above deals with writing a new value 
to the variable <variable_name>. Thus, like a subcommand SET, each takes 1 clock 
cycle. 

C. Translation of C++ programs into processor language 

[0061 1 According to an exemplary embodiment of the present invention, the method for 
translating a C++ programs into the processor language (described in Part B above) may 
minimize the size of obtained processor program. The size of a processor program is the 
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total number of subcommands that are included in all commands of the processor 
program. 

(1) Definitions 

a. F::VAR 

[0062] If a variable VAR is a local variable of the C++ function F or the variable VAR is 
an argument of the C++ function F, one may denote this variable F::VAR. 

b. Block 

[0063] Block is a sequence of processor commands. The present method constructs a 
processor program from some blocks. 

c. TRANS(<C++ program>) and TRANS(<C++ function>) 

[0064] A C++ program can be considered as a list of C++ functions that are allowed to 
call each other. Each of these functions may be translated into a processor language 
separately. 

[0065] Denote TRANS(<C++ program>) as a processor program that is the result of 
translation of the C++ program <C++ program> into the processor language. Denote 
TRANS(<C++ function>) as a block obtained as the result of translation of C++ function 
into the processor language. The processor program TRANS(<C++ program>) may be 
presented as the concatenation of blocks TRANS(<C++ function>) created for each 
function included in the C++ program. 

[0066] The C++ program ORDER shown in FIG. 1 may be shown as: 

0: TRANS(exch) 

1: TRANS(need_exch) 

2: TRANS(main) 
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[0067] The starting address of the obtained processor program is the starting address of 
the block TRANS(main) created for the main function of the C++ program. 

(2) Translation of C++ function 

[0068] Now how to create the block TRANS(<C++ function>) is described. Any C++ 
function has a body starting with the symbol "{" and finishing by the symbol "}". This 
body may be considered as a begin-end-command. This command may be denoted as 
<C++ function body>. 

[0069| For example, the body of the function "exch" of the program "ORDER" shown in 
FIG. 1 is the following begin-end-operator: 

f 

imp = yfposlj; y[posl]=y[pos2]; y[pos2]=tmp; 

} 



[0070] According to an exemplary embodiment of the present invention, the block 
TRANS(<C++ function>) is presented as: 

0: TRANS(<C++ function body>) 
1: <function Jinish_command> 

where the TRANS(<C++ function body>) is the block obtained as a result of translating 
the C++ command <C++ function body> into the processor language, and the 
<function Jinish_command> is the following processor command: 

a) if the <C++ function> is the main function, then the command 
<function Jinish_command> is the command: 
FIN 
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b) if the <C++ function> is any void function except the main function, 
then the command <function Jinish_command> is the command: 

RETURN 

c) if the <C++ function> is any unsigned integer function, then the 
command <function _finish_command> is empty. 

[0071] Now it is needed to explain how to create the block TRANS(<C++ command>) 
for any allowed C++ command. But before that, translation of expressions is described. 

(3) Translation of C++ expressions 

[0072] In the present invention, one may consider any C++ expressions that contain C++ 
variables, unsigned integer constants, arithmetic-logical operations (+, -,*,/, %, <, <=, >, 
>= == !=, !, ~ &, | ? A , &&, ||, «, », etc.), and calls of unsigned integer functions. 

[0073] A recursive definition of a C++ expression is now given. The expression may be 
one of: 

A) <unsigned_integer_constant>; 

B) <unsigned_integer_variable_name>; 

C) <unsigned_integer_array_name>[<index>]; 

D) <unary_operation> <argumentl>; 

E) <argumentl> <binary_operation> <argument2>; 

F) <C++ function> (<expressionl>, <expression2>. .. <expressionM>), 

where <index>, <argumentl>, <argument2>, <expressionl>, <expression2>... 
<expressionM> are some other expressions, <unary_operation> is one of the operations: 
~, !, and <binary_operation> is one of the operations: +, -, *, /, %, <, <=, >, >=, =, !=, 
&, |, A , &&, ||, «, », etc. 

[0074] Translation of the expression into the processor language in each of the cases A, 
B, C, D, E, F indicated above is now described. One may recursively define the block 
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TRANS(<expression>) as a result of translation of the <expression>, and the variable 
EXPR_VAR(<expression>) for storing the result of the evaluation of the expression after 
performing the block of commands TRANS(<expression>). 

a. Case A 

[0075] The <expression> is the constant. The TRANS(<expression>) may be defined as 
empty, and the EXPR_VAR(<expression>) may be undefined. 

b. Case B 

[0076] The <expression> is the variable. The TRANS(<expression>) may be defined as 
empty, and the EXPR__VAR(<expression>) may be defined as the variable 
<unsigned_integer_variable_name>. 

c. Case C 

[0077] The <expression> is the array element. The following may be defined: 
EXPR_VAR(<expression>)=<unsigned_integer_array_name>. 

[0078] The TRANS(<expression>) may be obtained as the following sequence: 

0: TRANS(<index>) 

1: <read_array_command> 

where the block TRANS(<index>) evaluates the expression <index>, and the command 
<read_array_command> may be created as follows: 

a) if the expression <index> is a constant, then the command 
<read_array_command> includes 1 subcommand: 

READJCONST <unsignedjnteger_array_name> <index> 
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b) if the expression <index> is not a constant and 
EXPR_VAR(<index>)==STCK_0, then the command <read_array_command> 
includes 2 subcommands: 

READ <unsignedjnteger_array_name> STCKO; 
DROP 1; 

c) if the expression <index> is not a constant and 
EXPR_VAR(<index>)!=STCKJ), then the command <read_array_command> 
includes 1 subcommand: 

READ <unsigned integer _arr ay _name> EXPR_VAR(<index>); 

d. Case D 

[0079| The <expression> is the unary operation with one argument. The following may 
be defined: EXPR_VAR(<expression>)=ALD_Z. The TRANS(<expression>) may be 
obtained as the following sequence: 

0: TRANS(<argumentl>) 
1: <call_ALD command> 

where the block TRANS(<argumentl>) evaluates the expression <argumentl>, and the 
command <call_ALD_command> may be created as follows: 

a) if the expression <argumentl> is a constant, then the command 
<call_ALD_command> includes 2 subcommands: 

SET CONST ALD O <argwnentl>; 
SET_CONST ALD_OP <unary_operation>; 

b) if the expression <argumentl> is not a constant and 
EXPR_VAR(<argumentl>)=STCK_0, then the command 
<call_ALD_command> includes 3 subcommands: 

SETALD O STCKJ); 
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DROP 1; 

SET CONST ALDOP <unary_pperation>; 
c) if the expression <argumentl> is not a constant and 
EXPR_VAR(<argumentl>)!=STCK_0, then the command 

<call_ALD_command> includes 2 subcommands: 

SETALDJ EXPR_VAR(<argwnentl>); 

SET CONST ALD OP <unary_pperation>; 

e. Case E 

[0080] The <expression> is the binary operation with two arguments. The following may 
be defined: EXPR_VAR(<expression>)=ALD_Z. The TRANS(<expression>) may be 
obtained as the following sequence: 

0: TRANS(<argumentl>) 

1: <savej2rgumentl _command> 

2: TRANS(<argument2>) 

3: <call_ALD_command> 

where the blocks TRANS(<argumentl>) and TRANS(<argument2>) evaluate the 
expressions <argumentl> and <argument2> 5 and the commands 
<save_argumentl_command> and <call_ALD_command> may be created as follows. 

[0081] The command <save_argumentl_command> may be described as follows: 

a) if the expression <argumentl> is a constant, or if 
EXPR_VAR(<argumentl >)=STCK_0, or if the block TRANS(<argument2>) is 
empty, then the command <save_argumentl_command> is empty; 

b) otherwise, the command <save_argumentl_command> includes 1 
subcommand: 

PUT EXPR_ VAR(<argumentl >); 
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[0082| The command <call_ALD_command> includes 4 subcommands: 
<call_ALD_subcommandl>, <call_ALD_subcommand2>, <call_ALD_subcommand3> 
and <call_ALD_subcommand4>. 

[00831 The subcommand <call_ALD_subcommandl> sets the value of the first argument 
of the operation to the variable ALD0: 

a) if the expression <argumentl> is a constant, then it is: 

SET_CONSTALD_0 <argumentl>; 

b) if expression <argumentl> is not a constant, 
EXPR_VAR(<argumentl>)=STCK_0, and 
EXPR_VAR(<argument2>)=STCK_0, then it is: 

SET ALDJ STCKJ; 

c) if expression <argumentl> is not a constant, 
EXPR_VAR(<argumentl>)=STCK_0, and 
EXPR_VAR(<argument2>)!=STCK0, then it is: 

SET ALDJ STCKJ); 

d) if expression <argumentl> is not a constant, 
EXPR_VAR(<argumentl>)!=STCK_0, and the block TRANS (<argument2>) is 
empty, then it is: 

SET ALD J EXPRJVAR(<argumentl>); 

e) if expression <argumentl> is not a constant, 
EXPR_VAR(<argumentl>)!=STCK_0, and the block TRANS(<argument2>) is 
not empty and constant, and EXPR_VAR(<argument2>)=STCK_0, then it is: 

SETALDJ) STCKJ; 

f) if expression <argumentl> is not a constant, 
EXPR_VAR(<argumentl>)!=STCK_0, the block TRANS(<argument2>) is not 
empty and constant, and EXPR_VAR(<argument2>)!=STCK_0, then it is: 

SET AID 0 STCKJ; 
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[0084| The subcommand <call_ALD_subcommand2> sets the value of the second 
argument of the operation to the variable ALD1 : 

a) if expression <argument2> is a constant, then it is: 

SET_CONSTALD_l <argument2>; 

b) if expression <argument2> is not a constant, then it is: 

SETALD1 EXPR_VAR(<argument2>); 

[0085] The subcommand <call_ALD_subcommand3> sets the identifier of the binary 
operation to the variable ALD OP: 

SET CONST ALD OP <binary_operation>; 

[0086] The subcommand <call_ALD_subcommand4> removes the values of the 
operation arguments from the stack EVALSTACK (if these values were stored in this 
stack): 

a) if the subcommand <call_ALD_subcommandl> is "SET ALD_0 
STCKJT, then it is: 

DROP 1; 

b) if the subcommand <call_ALD_subcommandl> is "SET ALDO 
STCK l", then it is: 

DROP 2; 

c) if the subcommand <call_ALD_subcommandl> is not a "SET ALD O 
STCKJT and not a "SET ALD O STCKJ", and the subcommand 
<call_ALD_subcommand2> is "SET ALD 1 STCKO", then it is: 

DROP 1; 

d) otherwise it is empty. 

f. Case F 
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|0087| The <expression> is the call of the C++ function F. Let M be the number of 
arguments of the function F 5 and denote the arguments of this function: a_l, a_2, ... a_M. 
The called function puts the returned value into the stack EVAL_STACK. Thus, the 
following may be defined: EXPR_VAR(<expression>)=STCK_0. 

[0088] The TRANS(<expression>) may be obtained as the following sequence: 

0: TRANS(<expressionl>) 

1: <load argument 1 _command> 

2: TRANS(<expression2>) 

3: <load_argument2_command> 



2M-1: TRANS(<expressionM>) 
2M: <load_argumentM_command> 
2M+1: <calljoommand> 

where the blocks TRANS(<expressionl>), TRANS(<expression2>), 
TRANS(<expressionM>) are created to evaluate the expressions <expressionl>, 
<expression2>, ... <expressionM>. The processor commands 

<loadargumentl_command>, ...<load_argumentM_command>, and <call_command> 
are described below. 

[0089] The command <load_argumentK_command>, where K=1,2,...M, may be 
described as follows: 

a) if expression <expressionK> is a constant, then the command 
<load_argumentK_command> includes 1 subcommand: 

SET_CONSTF::a_K <expressionK> ; 

b) if expression <expressionK> is not a constant and 
EXPR_VAR(<expressionK>)=STCK_0, then the command 
<load_argumentK_command> includes 2 subcommands: 
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SETFr.aJCSTCKJ; DROP 1; 
c) if expression <expressionK> is not a constant and 
EXPR_VAR(<expressionK>)!=STCK_0 5 then the command 

<load_argumentK_command> includes 1 subcommand: 

SET F::a_K EXPR_VAR(<expressionK>); 

[0090] The command <call_command> includes 1 subcommand: 
CALL <addr_F>; 

where the <addr_F> is the address of the first command in the block TRANS(F), the 
block obtained as a result of the translation of the C++ function F. 

[0091] The following examples may be used to illustrate the method of translation of the 
expressions described above. 

g. Example 1 

[0092] Consider the C++ expression that is included in the program ORDER: 
need_exch(j, k) shown in FIG. 1. This expression is a call of function need_exch, so one 
may apply the Case F described above. 

[0093] According to the Case F the block TRANS("need_exch(j, k)") may be presented 
as: 

0: TRANS("j") 

J: <load argument 1 _command> 
2: TRANSC'k") 

3: <load_argument2_command> 
4: <call_command> 

[0094] Both expressions "j" and "k" are variables. Thus, one may apply the Case B for 
each of these expressions, which means that blocks TRANS("j") and TRANS("k") are 
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empty, EXPR_VAR("j")="j" and EXPR_VAR("k")= "k". Then one may apply the case 
c) for definition of the commands <load_argumentl_command> and 
<load_argument2_command>. Therefore, <load_argumentl_command> = SET 
need_exch::posl j, and <load_argument2_command> = SET need_exch::pos2 k. 
Consequently, one may obtain the final block TRANS("need_exch(j, k)"): 

0: SET need _exch::posl j 
1: SET need _exch::pos2 k 

2: CALL <address of first command of block TRANS(need_exch)> 
h. Example 2 

[0095] Now a more complicated example of the C++ expression is considered: 

EXPR = need_exch(j r k) + need_exch(U 3). 

[0096] This expression is the sum (binary operation) of two arguments: need_exch(j i k) 
and need_exch(i f 3). Thus, one may apply the Case E. Consequently, one may present 
the block TRANS (EXPR) as: 

0: TRANS("need_exch(j, k)") 
1: <save_argumentl _command> 
2: TRANS("need_exch(i, 3)") 
3: <call_ALD_command> 

[0097] The block TRANS("need_exch(j, k)") has been considered in the Example 1 
above. The block TRANS("need_exch(i, 3)") may be created like a block 
TRANS("need_exch(j, k)") as follows: 

0: SET need _exch::posl i 

1: SET CONST need_exch::pos2 3 

2: CALL <address of first command of block TRANS(need_exch)> 
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[0098] Both EXPR_VAR("need_exchG, k)") and EXPR_VAR("need_exch(i, 3)") are 
equal to STCKJ). Thus, one may apply the case a) for definition of the command 
<save_argumentl_command>. Thus, <save_argumentl_command> is empty. The 
command <call_ALD_command> includes 4 subcommands. For the subcommand 
<call_ALD_subcommandl> the case b) is applied for definition, for the subcommand 
<call_ALD_subcommand2> the case b) is applied for definition, and for the 
subcommand <call_ALD_subcommand4> the case b) is also applied for definition. 
Taking all into account, the final block TRANS(EXPR) may be written as follows: 

0: SET need_exch::posl j 
1: SET need _exch: :pos2 k 

2: CALL <address of first command of block TRANS(need_exch)> 

3: SET need_exch::posl i 

4: SET _CONST need _exch::pos2 3 

5: CALL <address of first command of block TRANS(need_exch)> 

6: SET AID J) STCKJ; SET AID J STCKJ; SET_CONST ALD_OP 

"+ "; DROP 2; 

(4) Translation of C++ commands 

a. Return-operator 

[0099] The return-operator can be of one of two following types: 1 A and IB. 

IA. Return-operator inside the void functions: return; 

[00100] The result TRANS(<return-operator>) of the translation of this operator is one 
processor command: 

RETURN; 

IB. Return-operator inside the functions that returns unsigned 
integer value: return <expression>; 
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[001011 The values returned by unsigned integer C++ functions are placed in the stack 
EVALSTACK. The block TRANS(<return-operator>) may be created as the following: 

0: TRANS(<expression>) 
1: <put_value_command> 

where the command <put_value_command> includes 2 subcommands: 
<put_value_subcommandl> and the subcommand RETURN. 

[00102] The subcommand <put_value_subcommandl> may be created according to the 
following rules: 

a) if the expression <expression> is a constant, then the subcommand 
<put value_subcommandl> is: 

PUT_CONST <expression>; 

b) if the expression <expression> is not a constant and 
EXPR_VAR(<expression>)==STCK_0, then the subcommand 
<putvalue_subcommandl> is empty; 

c) if the expression <expression> is not a constant and 
EXPR_VAR(<expression>)!=STCK_0 ? then the subcommand 
<put_value_subcommandl> is: 

PUT EXPR_ VAR(<expression > ); 

[00103] Now the return-operator return (y[posl]<y[pos2]); from the program ORDER 
shown in FIG. 1 is considered. One may apply the type IB. One may obtain that 
EXPR_YAR(y[poslJ<y[pos2D =ALD Z, and the block TRANS(y[posl]<y[pos2D 
is: 

0: READyposl; 

l:PUTy; 

2: READy pos2; 
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3: SET ALDJ STCKJ; SET ALDJ y; SET CONST ALD OP "<"; 
DROP 1; 

[00104] Thus in order to construct the block TRANS("return (y[posl]<y[pos2])f\ the 
following command may be appended: 

4: PUT ALD_Z; RETURN; 

to the end of the block TRANS("y[posJ]<y[pos2]"). 

b. Set-operator 
[00105] The set-operator can be of one of two following types: 2A and 2B. 

2A. <variable_name> = <expression> 

[00106] The block TRANS(<set-operator>) is: 

0: TRANS(<expression>) 
1: <set_command> 

where the command <set_command> may be created according to the following rules: 

a) if the expression <expression> is a constant, then the command 
<set_command> is: 

SET CONST <variable_name> <expression> ; 

b) if the expression <expression> is not a constant and 
EXPR_VAR(<expression>)==STCK_0, then the command <set_command> is: 

SET <variable_name> STCKJ); DROP 1; 

c) if the expression <expression> is not a constant and 
EXPR_VAR(<expression>)!=STCK_0, then the subcommand <set_command> 
is: 
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SET <variable_name> EXPR_VAR(<expression>); 

[00107] Now the se/-operator k=j+l from the program ORDER shown in FIG. 1 is 
considered. Applying the method of the translation of the expressions, one may obtain 
that EXPR_VAROy+/") ==ALD_Z, and the block TRANS("j+l ") is 

0: SET AID _0 j; SETJOONST ALDJ 1; SETJOONST ALD_OP "+"; 

[00108] Thus, the required block TRANS("£=/+7") may be obtained according to rule c) 
above as follows: 

0: SET ALDJ j; SET JONST ALDJ 1; SET_CONSTALD_OP"+"; 
1 : SET k ALDJ; 

2B. <unsignedJnteger_array>[<indexexpression>] = <expression> 

[00109] The block TRANS(<set-operator>) is: 

0: TRANS(<expression>) 
1: <savejBxpressionjcommand> 
2: TRANS(<indexjexpression>) 
3: <set_command> 

[00110] The command <save expression_command> may be obtained according to the 
following rules: 

a) if the block TRANS(<expression>) is empty, the block 
TRANS(<index_expression>) is empty, or 
EXPR_VAR(<expression>)==STCK_0, then the command 
<save_expression_command> is empty. 

b) otherwise, the command <save_expression_command> is: 

PUT EXPRJAR(<expression>); 
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(001111 The command <set__command> includes 2 subcommands: <set_subcommandl> 
and <set_subcommand2> obtained according to the following rules: 

a) if the expression <expression> is a constant, the following cases aa), ab), and 
ac) are considered: 

aa) if the expression <index_expression> is a constant then the following 
may be assigned: 

<set_subcommandl>: WRITE CONST CONST 

<unsignedjntegerjxrray> <expression> <index_expression> ; 
<set_subcommand2>: empty. 

ab) if the expression <index_expression> is not a constant and 
EXPR_VAR(<index_expression>)==STCK_0, then one may assign: 

<set_subcommandl>: WRITE _CONST_VAR 

<unsigned_integer_array> <expression> STCKJ); 
<set_subcommand2>: DROP L 

ac) if the expression <index_expression> is not a constant and 
EXPR_VAR(<index expression>)!=STCK_0, then one may assign: 

<set_subcommandl>: WRITE _CONST_VAR 
< unsigned Jnteger_array> <expression> 
EXPR_ VAR( < index _expression>); 

<set_subcommand2>: empty; 

b) if the expression <expression> is not a constant and at least one of the 
following two conditions is valid: 

- EXPR_VAR(<expression>)=STCK_0 

- the command <save_expression_command> is not empty; 
then one may consider cases ba), bb), and be) as follows: 

ba) if the expression <index_expression> is a constant then one may 
assign: 
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<set_subcommandl>: WRITE _VARJC0N ST 
<unsigned_integer_array> STCKO <index_expression>; 
<set_subcommand2>: DROP 1. 
bb) if the expression <index_expression> is not a constant and 
EXPR_VAR(<index_expression>)==STCK_0, then one may assign: 

<set_subcommandl>: WRITE _VAR_VAR 

<unsigned_integer_array> STCK l STCK O; 
<set_subcommand2>: DROP 2, 
be) if the expression <index_expression> is not a constant and 
EXPR_VAR(<index_expression>)!=STCK_0, then one may assign: 

<set_subcommandl>: WRITE _ VAR_ VAR 
<unsigned_integer_array> STCKO 
EXPR_ VAR( < index _expression>); 

<set_subcommand2>: DROP 1; 

c) if the expression <expression> is not a constant, 
EXPR_VAR(<expression>)!=STCK_0 ? and the command 

<save_expression_command> is empty, then the following cases ca), cb), and cc) 
may be considered: 

ca) if the expression <index_expression> is a constant, then one may 
assign: 

<set_subcommandl>: WRITE _VAR_CONST 

<unsignedjntegerj2rray> EXPR_VAR(<expression>) 
< index _expression> ; 

<set_subcommand2> empty. 

cb) if the expression <index_expression> is not a constant and 
EXPR_VAR(<index_expression>)=STCK_0, then one may assign: 

<set_subcommandl>: WRITE _VAR_VAR 
<unsigned_integer_array> EXPR_VAR(<expression>) STCK O; 
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<set_subcommand2>: DROP L 
cc) if the expression <index_expression> is not a constant and 
EXPRJVAR(<index_expression>)!=STCK_0, then one may assign: 

<set_subcommandl>: WRITE _ VAR_ VAR 
< unsigned Jnteger_array> EXPR_ VAR ( < expression>) 

EXPR_ VAR(< index _expression>); 

<set_subcommand2>: empty; 

[00112] Now the sef-operator y[i]=x[i] in the program ORDER shown in FIG. 1 is 
considered. In this example, the following are true: <expression>="x[i]", 
<index_expression>="i", and <unsigned_integer_array>- y\ So long as the block 
TRANS(<index_expression>) is empty, the rule "a" may be applied for definition of the 
command <saveexpression_command>. Thus, this command is empty. 
EXPR_VAR("i")= "i", EXPR_VAR("x[i]")= V. One may apply the rule "cc" for the 
definition of the command <set_command>. Thus, the block TRANS("y[i]=x[i]") is: 

0: READ x i; // TRANS( u x[i]") 

1 : WRITE J/AR_VAR y x i; // set_command 

c. Call-operator 

[00113] The C++ program allows calling any void C++ function. The call of this function 
may be considered as the evaluation of the expression of the Case F above. Thus, the 
block TRANS(<call-operator>) may be defined as: 

TRANS(<call-operator expression>). 

d. If-operator 

[00114] The //-operator is the following operator 

if (<expression>) <positive_operator> 
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else <negativej)perator> 

where operators <positive_operator> and <negative_operator> are allowed to be empty. 

[00115] To create the block TRANS(<if-operator>), the following cases may be 
considered: 

a) if the expression <expression> is a constant equal to zero, then the 
block TRANS(<if-operator>) is the block TRANS(<negative_operator>); 

b) if the expression <expression> is a constant that is not equal to zero, 
then the block TRANS(<if-operator>) is the block TRANS(<positive_operator>); 

c) if the expression <expression> is not a constant and both 
TRANS(<negative_operator>) and TRANS(<positive_operator>) are empty 
blocks, then the block TRANS(<if-operator>) is empty; 

d) if the expression <expression> is not a constant, the 
TRANS(<negative_operator>) is an empty block, and the 
TRANS(<positive_operator>) is not an empty block, then the block TRANS(<if- 
operator>) is: 

0: TRANS(<expression>) 

1: <compare Jump _command> 

2: TRANS(<positive_operator>) 
where the command <compareJump command> includes 2 subcommands. If 
EXPR_VAR(<expression>)=STCK_0, then the first subcommand is DROP 1, 
otherwise the first subcommand is empty. The second subcommand is: 

ZERO JUMP EXPR_VAR(<expression>) <address> 
where <address> is the address of the first command that situates after the block 
TRANS(<positive_operator>). 
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e) if the expression <expression> is not a constant, the 
TRANS(<negative_operator>) is not an empty block, and the 
TRANS(<positive_operator>) is an empty block, then the block TRANS(<if- 
operator>) is: 

0: TRANS(<expression>) 

1: < compare Jump_command> 

2: TRANS(<negative_operator>) 
where the command <compareJump_command> includes 2 subcommands. If 
EXPR_VAR(<expression>)=STCK_0, then the first subcommand is DROP 7, 
otherwise the first subcommand is empty. The second subcommand is: 

NONZERO JUMP EXPRVAR(<expression>) <address> 
where <address> is the address of the first command that situates after the block 
TRANS(<negative_operator>). 

f) if the expression <expression> is not a constant and both 
TRANS (<negative_operator>) and TRANS(<positive_operator>) are not empty 
blocks, then the block TRANS(<if-operator>) is; 

0: TRANS(<expression>) 

1: < compare Jump command> 

2: TRANS(<positive_operator>) 

3: JUMP <address of the first command after the block 
TRANS(negative_operator)> 

4: TRANS(<negative_operator>) 
where the command <compareJump_command> includes 2 subcommands. If 
EXPR_VAR(<expression>)=STCK_0, then the first subcommand is DROP 7, 
otherwise the first subcommand is empty. The second subcommand is: 

ZERO JUMP EXPR_VAR(<expression>) <address> 
where <address> is the address of the first command of the block 
TRANS(<negative_operator>). 
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e. For-operator 
[00116] The for-operator is as follows: 

for (<at_start>;<expression>;<atJteration Jinish>) <body> 

where <at_start> and <at_iteration_finish> are the se/-operators or empty, and <body> is 
any allowed C++ operator. 

[00117] To create the block TRANS(<for-operator>), the following cases may be 
considered: 

a) if the expression <expression> is a constant equal to zero, then the 
block TRANS(<for-operator>) is empty; 

b) if the expression <expression> is a constant that is not equal to zero, 
then the block TRANS(<for-operator>) is: 

0: TRANS(<at_start>); 
1: BODY^BLOCK; 
where the block BODY BLOCK is: 
0: TRANS(<body>); 
1: TRANS(<at_iteration Jinish>); 

2: JUMP <address of first command of block BODY _BLOCK> 

c) if the expression <expression> not a constant, then the block 
TRANS(<for-operator>) is: 

0: TRANS(<at_start>); 
1: TRANS(<expression>); 
2: <check_condition_command> 
3: TRANS(<body>); 
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4: TRANS(<atJteration _fmish>) 

5: JUMP <address of first command of block 
TRANS( <expression>) +<check_condition_command> > 
where the command <check_condition_command> includes 2 subcommands. If 
EXPR_VAR(<expression>)=STCK_0, then the first subcommand is DROP 7, 
otherwise the first subcommand is empty. The second subcommand is: 

ZERO JUMP EXPR_VAR(<expression>) <address> 
where <address> is the address of the first command that situates after the 
obtained block TRANS(<for-operator>). 

[00118] In practice, most of the /or-operators are such that the operator <at_start> 
initializes a variable (for example, i=0), the operator <at_iteration_finish> is the 
increment or decrement operator (i=i+l, or i=i-l), and the expression <expression> 
compares the value of a variable with a constant (i<10, i>=9, etc.). 

[00119] The 4 subcommands called LOOP* in the processor language are used for 
optimization of the ^br-operators of the considered types. These commands are capable 
of doing 4 actions simultaneously: 

- TRANS(<expression>) 

- <check_condition_command> 

- TRANS(<at_iteration Jinish>) 

JUMP <address of first command of block 
TRANS ( <expression>) + < check condition _command> > 

[00120] The following two examples may be used to demonstrate how to use the 
subcommands LOOP* instead of the rule c) described above 

[00121] First, consider the ^r-command from the program ORDER shown in FIG. 1 : 
for (i=0;i<10;i=i+l)y[i]=x[i]; 
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Instead of creating the block according to the rule c) described above, the block 
TRANS(<for-operator>) may be created as: 

0: TRANS("i=0") 

1: TRANS("y[i]=x[i]") 

2: LOOPJNCJVOMORE i 8 <address of first command of block 
TRANS("y[i]=x[i]")> 

[001221 Now consider the following example of a ^or-command: 
for (i=9;i>=0;i=i-l) y[ij=xfij; 

Instead of creating the block according to the rule c) described above, the block 
TRANS(<for-operator>) may be created as: 

0: TRANS("i=9") 

1: TRANS("yfiJ=xfiJ") 

2: LOOP_DEC_NOLESS i 1 <address of first command of block 
TRANS("y[i]=x[i]")> 

f. Begin-end-operator 

[00123| Each begin-end-operator may be translated as the concatenation of the blocks 
created for each of operators included inside this begin-end-operator. The block 
TRANS(<begin-end-operator>) is the following block: 

0: TRANS(<operator_l>) 
1: TRANS(<operator_2>) 



N: TRANS(<operator_N>) 
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g. Break-operator 

[00124] The break-operator: break; is placed inside the while and for operators to finish a 
cycle. Thus, the block TRANS(<break-operator>) may include one processor command: 

JUMP <address of first command after the cycle> 

(5) Result 

[00125] FIG. 2 shows a result after the C++ program "ORDER" shown in FIG. 1 is 
translated into a parallel processor program in accordance with an exemplary 
embodiment of the method described above. Note that the staring address ("18") in FIG. 
2 is the address of the function "main". 

[00126] It is noteworthy that the method described in the foregoing Part C may translate a 
C++ programs into a parallel processor language with a minimum processor program 
length, but not necessarily with a minimum execution time. Part D below will describe a 
method (e.g., some powerful algorithms) to optimize execution time of a processor 
program. 

D. Optimization of execution time of parallel processor programs 

[00127] A processor program loaded to a processor may be written in two different ways: 
(1) the program may be first written in some high-level programming language like C++, 
Pascal, etc., and the program may be then translated into the processor language; and (2) 
the program may be written in a processor language manually. In either case, it would be 
desirable to provide a method (e.g., some algorithms, and the like) that allows the 
execution time of a processor program to be optimized (e.g., minimized). 

[00128] For example, commands 0-5 in FIG. 2 correspond to the C++ function "exch" 
shown in FIG. 1. It is easy to understand that the evaluation time of the function "exch" 
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is equal to 6 clock cycles (the evaluation time of 6 processor commands with addresses 0- 
5). However, the function "exch" may be optimized as follows: 

0: READ y exch: :pos 1 ; // function "exch" 

1: SET exch::tmp y; READ y exch::pos2; 

2: WRITE yyposl; 

3: WRITE y exch::tmp pos2; RETURN; 

where it takes only 4 clock cycles to evaluate the "exch" function. This is the minimal 
possible execution time because during execution of the function "exch" at least 4 
reading/writing operations with the array "y" are required: reading the values y[posl], 
y[pos2], and later writing new values to these elements of the array y. Because each of 
these operations takes 1 clock cycle to perform, at least 4 clock cycles are required to 
carry out the function "exch". 

(1) Method of execution time optimization 

[00129] Denote a set OBJ that contains the following objects: 

1) all variables and arrays that are used in the processor program; 

2) stack EVALSTACK; 

3) variables ALD0, ALD1, ALD_OP, ALD Z. 

[00130] For any variable <var>, denote the subset ACCESS JLIST(<var>) of the set OBJ 
as follows: 

a) If <var>=ALD_Z, then ACCESS_LIST(<var>) = {ALD_0, ALD1, 
ALD_OP}. 

b) If <var>=STCK_0, STCKJ, e.t.c, then ACCESS_LIST(<var>) = 
{EVALSTACK} . 

c) Otherwise, ACCESS_LIST(<var>) = {<var>}. 
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[00131] For each subcommand <subcommand>, denote 2 subsets 
ACCESS(<subcommand>) and LOCK(<subcommand>) of the set OBJ described above 
as follows: 

1A. Subcommand FIN. LOCK = empty. ACCESS = empty. 

2 A. Subcommand SET <variable_name_l> <variable_name_2> . LOCK 
= {<variable_name_l>}. ACCESS = ACCESS_LIST(<variable_name_2>). 

2B. Subcommand SETCONST <variable_namej> 

<unsigned_integer_constant>. LOCK = {<variable_name_l>}. ACCESS = 
empty. 

2C. READ <array_name> <variable_name> . LOCK = {<array_name>}. 
ACCESS - ACCESS_LIST(<variable_name>). 

2D. READCONST <array_name> <unsigned_integerjconstant>. 
LOCK = {<array_name>}. ACCESS - empty. 

2F. WRITE _ VARVAR <array_name> <variable_name> 
<index variable _name> . LOCK = {<array_name>}. ACCESS 
ACCESS_LIST(<variable_name>) U ACCESS_LIST(<index_variable_name>). 
(The symbol "U" means theoretical union of two sets). 

2G. WRITE VARJOONST <array_name> <variablejiame> 
<index_constant>. LOCK = {<array_name>}. ACCESS 
ACCESS_LIST(<variable_name>). 

2H. WRITE CONST _ VAR <arrayjiame> <constant> 

<index_variable_name> . LOCK = {<array_name>}. ACCESS 
ACCESS_LIST(<index_variable_name>). 

21. WRITE CONST CONST <array_name> <constant> 
<index constant> . LOCK = {<array_name>}. ACCESS = empty. 

4A. PUT <variable_name>. LOCK - {EVAL STACK} . ACCESS = 
ACCESS_LIST(<variable_name>). 

4B. PUTCONST <unsigned_integer_constant>. LOCK 
{EVAL STACK} . ACCESS = empty. 
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AC. DROP <unsigned_integer_constant>. LOCK = {EVALSTACK} 
ACCESS = empty. 

5A. CALL <address>. LOCK = empty. ACCESS = empty. 

5B. RETURN. LOCK = empty. ACCESS = empty. 

6A. JUMP <address>. LOCK = empty. ACCESS = empty. 

6B. ZERO JUMP <variable_name> <address>. LOCK = empty 
ACCESS = ACCESS_LIST(<variable_name>). 

6C. NONZEROJUMP <variable_name> <address>. LOCK = empty 
ACCESS = ACCESS_LIST(<variable _name>). 

6D. LOOPJNCNOLESS <variable_name> 
<address>. LOCK = {<variable_name>}. 
ACCESS_LIST(<variable_name>). 

6E. LOOP_INC_NOMORE <variablejiame> 
<address>. LOCK = {<variable_name>}. 
ACCESS_LIST(<variable_name>). 

6F. LOOP_DEC_NOLESS <variable_name> 
<address>. LOCK = {<variable_name>}. 
ACCESS_LIST(<variable_name>). 

6G. LOOPDECNOMORE <variable_name> 
<address>. LOCK = {<variable_name>}. 
ACCESS_LIST(<variable_name>). 



<constant_yalue> 
ACCESS 

<constant_value> 
ACCESS 

<constant_value> 
ACCESS 

<constant_value> 
ACCESS 



[00132| The defined sets LOCK(<subcommand>) and ACCESS(<subcommand>) are very 
important for the present method of optimization. The set LOCK(<subcommand>) 
includes a list of objects OBJ that are used by this subcommand in an exclusive mode. It 
is not allowed to place two different subcommands Subcommand 1> and 
<subcommand2> in the same command of a processor program if the sets 
LOCK(<subcommandl>) and LOCK(<subcommand2>) intersect (or have a common 
element). 
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[00133] The set ACCESS(<subcommand>) includes objects whose values are used by this 
subcommand but not changed by this subcommand. It is allowed to place two or more 
subcommands in the same command if these subcommands have intersected sets 
ACCESS. 

[00134] Let the subcommands Subcommand 1> and <subcommand2> be such that the 
sets ACCESS(<subcommandl>) and LOCK(<subcommand2>) are intersected. Then it 
is allowed to place these subcommands in the same command of a processor program if 
and only if the subcommand Subcommand 1> is placed before the subcommand 
<subcommand2>. For example, in the command 1 of last example (optimized function 
"exch"), there are 2 subcommands: "SET exch::tmp y" and "READ y exch::pos2". The 
array "y" belongs to both sets ACCESS("SET exchntmp y") and LOCK("READ y 
exch::pos2"). Nevertheless it is allowed because the subcommand "SET exch::tmp y" is 
placed before the subcommand "READ y exch::pos2". 

[00135] A command <command> is a jump-target command if there is a jump- 
subcommand with the address equal to the address of the command <command>. For 
example, in the processor program ORDER shown in FIG. 2, there are several jump- 
target commands: 0 (command 34 contains a jump-subcommand with address 0), 6 
(command 30 contains a jump-subcommand with address 6 ), 23 (command 37 contains a 
jump-subcommand with address 23), and the like. 

[00136] A command <command2> does not depend from a command <commandl> if and 
only if for each subcommand pair (Subcommand 1> of the command <commandl>, and 
<subcommand2> of the command <command2>), the following 2 conditions are met: 

1) the sets LOCK(<subcommandl>) and LOCK(<subcommand2>) are not 
intersected; and 



45 



03-0929 



2) the sets LOCK(<subcommandl>) and ACCESS(<subcommand2>) are 
not intersected. 

[00137] For each command <command>, SIZE(<command>) may be denoted as the 
number of subcommands included in the command <command>. For each command 
<command>, the following is true: SIZE(<command>) <= N, where N is the maximal 
number of subcommands inside one processor command. 

[00138] FIG. 3 is a flow diagram showing an exemplary method 300 for optimizing 
execution time of a parallel processor program in accordance with an exemplary 
embodiment of the present invention. The method 300 may start with step 302 in which 
a parallel processor program is received. The parallel processor program may be written 
in the processor language described in Part B of the present application. Dummy jumps 
optimization is performed 304. Linear code optimization is performed 306. Jumps 
optimization is performed 308. An inquiry is held in step 310 to see if there is any 
change made in the steps 304 - 308. If the answer is yes, the method goes back to the 
step 304. If the answer is no, then the resulted processor program is output 312. 

[00139] The steps 304 - 308 are each described in detail below. 

(2) Dummy jumps optimization 

[00140] FIG. 4 is a flow diagram showing an exemplary embodiment of the step 304 of 
performing dummy jumps optimization shown in FIG. 3 in accordance with an 
exemplary embodiment of the present invention. As shown, the step 304 may start with 
step 402 in which transition of jumps is applied 402. In the step 402, commands of the 
processor program (command 0, command 1, command 2, and the like) are examined. 
For each command <command>, all jump-subcommands of the current command are 
examined. For each jump-subcommand <subcommand>, an inquiry is held to check if 
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the jump-target command of the subcommand <subcommand> only includes 
subcommand: JUMP <new_address>. If the answer is yes, the jump address of the 
command <subcommand> is replaced with the new address <new_address>. The 
following is an example: 

<subcommand>: ZERO JUMP <var> <addr> 



addr: JUMP <new_addr> 

When the above example is met in the program, the address <addr> in the subcommand 
is replaced with the new address <new_addr>, and the new <subcommand> may be 
obtained as follows: 

ZERO JUMP <var> <new_addr> 

[00141] Then, unreachable jumps are removed 404. These unreachable jumps (jumps that 
can never be reached) may appear after the step 402. In the step 404, commands of the 
processor program (command 0, command 1, command 2, and the like) are examined. 
For each command <command>, if the command <command> includes only 
subcommand JUMP, if the command <command> is not a jump-target, and if the 
previous command has subcommand JUMP (where the previous command is the 
command with an address less than the address of the current command <command> by 
1), then the current command <command> is removed. 

[00142] Next, dummy jumps are removed 406. In the step 406, commands of the 
processor program (command 0, command 1, command 2, and the like) are examined. 
For each command <command>, if the command <command> has address ADDR and 
has the subcommand JUMP (ADDR+1), then this subcommand is removed. If this 
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subcommand is the only subcommand of the command <command>, then the command 
<command> is removed. 

(3) Linear code optimization 

[00143] The linear code optimization deals with processor program domains that contain 
the non-jump-subcommands only. The domain is a command or several neighboring 
commands of a processor program. The command(s) included in a domain may have 
addresses ADDR, ADDR+1, (ADDR+M-1), where the ADDR is the address of the 
first command in the domain, and the M is the number of commands in the domain (or 
the length of the domain). A domain is a linear domain if an only if the following 2 
conditions are met: 

- the domain includes non-jump-subcommands only; 

- all the commands of the domain other than the first command are not 
jump-target commands. 

[00144] For example, here is a list of linear domains in the processor program ORDER 
shown in FIG. 2: commands 0-4 (first address ADDR=0, length M=5), command 6 
(ADDR=6, M=l), commands 8-11 (ADDR=8, M=4), commands 13-16 (ADDR=13, 
M=4), command 18 (ADDR=18, M=l), commands 19-20 (ADDR=19, M=2), command 
22 (ADDR=22, M=l), command 23 (ADDR=23, M=l), command 24 (ADDR=24, M=l), 
commands 26-29 (ADDR=26, M=4), commands 32-33 (ADDR=32, M=2), and 
command 35 (ADDR=35, M=l). 

[00145] FIG. 5 is a flow diagram showing an exemplary embodiment of the step 306 of 
performing linear code optimization shown in FIG. 3 in accordance with an exemplary 
embodiment of the present invention. The linear code optimization deals with the linear 
domains only. It takes one linear domain (called original domain), optimizes it, and 
replaces the original domain with the optimized one in the processor program. In a 
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preferred embodiment, all the linear domains with length M > 1 are examined. The step 
306 may be applied to each linear domain <domain> with the first address ADDR and the 
sizeM > 1. 

[00146] As shown in FIG. 5, the step 306 may start with step 502 in which all commands 
included in the domain <domain> starting from the second command are examined. That 
is, commands with addresses ADDR+1, ADDR+M-1 may be examined. For each 
examined command <command> with address K (where K=ADDR+1, ADDR+M-1), 
all subcommands of the command <command> are examined. For each subcommand 
<subcommand>, the following steps may be performed: 

a) evaluate the sets LOCK(<subcommand>) and 
ACCESS(<subcommand>); 

b) find such a maximal address TLOCK that K>T_LOCK>=ADDR and 
the command with address TLOCK includes a subcommand <subcommand2> so 
that the set LOCK(<subcommand2>) intersects with the set 
LOCK(<subcommand>). If there is no such address T LOCK, assign 
T LOCK— 1; 

c) find such a maximal address TLOCKACCESS that 
K>T_LOCK_ACCESS>=ADDR and the command with address 
TLOCKACCESS includes a subcommand <subcommand2> so that the set 
ACCESS(<subcommand2>) intersects with the set LOCK(<subcommand>). If 
there is no such address TLOCKACCESS, assign T_LOCK_ACCESS=-l; 

d) find such a maximal address TACCESSLOCK that 
K>T_ACCESS_LOCK>=ADDR and the command with address 
T_ACCESS_LOCK includes a subcommand <subcommand2> so that the set 
LOCK(<subcommand2>) intersects with the set ACCESS(<subcommand>). If 
there is no such address TACCESSLOCK, assign T_ACCESS_LOCK=-l; 
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e) define address T=max(ADDR, TLOCK+l, TJLOCK_ACCESS, 
T_ACCESS_LOCK+l). The address T may take values K>=T>=ADDR. 

f) if T==K, then go to step i) below, otherwise go to step g) below. 

g) if SIZE(command with address T)<N, then the subcommand 
<subcommand> is appended to the command with the address T, and the 
subcommand <subcommand> is removed from the command <command> (that is 
the command with address K). Then go the step i) below. 

h) set T=T+1, and go back to the step f) above. 

i) finish processing the subcommand <subcommand>. 

[00147] Next, all empty commands are removed from the domain <domain> 504. After 
the step 502, some commands in the domain <domain> may become empty (because all 
the subcommands of these commands may be moved up to the previous commands). 

[00148] The following is an example: consider the domain containing commands 0-4 of 
the processor program ORDER shown in FIG. 2: 

0: READ y exch::posl; 

1: SET exch::tmp y; 

2: READ y exch::pos2; 

3: WRITE yyposl; 

4: WRITE y exch::tmp pos2; 

[00149] The command with K=l (i.e., command 1) may be first examined, which has one 
subcommand "SET exch::tmp y". Applying the steps a) - i) of the step 502 described 
above to this subcommand, the following may be obtained: 

LOCK("SET exch::tmp y")={exch::tmp}; 
ACCESS("SET exch::tmp y")={y}; 
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T_LOCK=-l (there is no command in the domain before the current 
examined command that locks the variable "exch::tmp"); 

T_LOCK_ACCESS=-l (there is no command in the domain before the 
current examined command that accesses the variable "exchntmp"); 

T_ACCESS_LOCK=0 (there is a command 0 that has a subcommand that 
locks the array "y"); 

T=l; 

So long as T— K, nothing need be done with the subcommand "SET exch::tmp y". 

[00150] The command with K=2 (i.e., command 2) is now examined, which has one 
subcommand "READ y exch::pos2". Applying the steps a) - i) of the step 502 described 
above to this subcommand, the following may be obtained: 

LOCK("READ y exch::pos2")={y}; 
ACCESS("READ y exch::pos2")={exch::pos2}; 

T_LOCK=0 (there is a command 0 that has a subcommand that locks the 
variable "y"); 

T_LOCK_ACCESS=l (there is a command 1 in the domain before the 
current examined command that accesses the variable "y"); 

T_ACCESS_LOCK=-l (there is no command in the domain before the 
current examined command that accesses the variable "exch::pos2"); 

T=l; 

So long as the size of the command with address T is equal to 1 < N, the subcommand 
"READ y exch::pos2" is moved from the command K=2 to the command T=l. Thus, the 
following may be obtained: 

0: READ y exch::posl; 



51 



03-0929 



1: SET exch::tmp y; READ y exch::pos2; 
2: 

3: WRITE yyposl; 

4: WRITE y exch::tmp pos2; 

[00151] Then the command 3 is examined, and subcommand "WRITE y y posl" is moved 
from the command 3 to the command 2. Then the command 4 is examined, and the 
subcommand "WRITE y exch::tmp pos2" is moved from the command 4 to the command 
3. After all these, the following may be obtained: 

0: READ y exch::posl; 

1: SET exch::tmp y; READ y exch::pos2; 

2: WRITE yyposl; 

3: WRITE y exch::tmp pos2; 

4: 

In the step 504, the empty command 4 is removed. 

(4) Jumps optimization 

[00152] The jumps optimization deals with optimization of commands containing some 
jump-subcommands and may be performed in the step 308 shown in FIG. 3. In the step 
308, commands of the processor program (command 0, command 1, command 2, and the 
like) are examined. For each command <command>, the following scenarios A-D may 
be considered. 

a. Scenario A 
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(00153] If the previous command has no jump-subcommand, the command <command> 
has jump-subcommand(s) and does not depend from the previous command, and 
SIZE(previous command)+SIZE(<command>) <= N, then the following cases a) and b) 
are considered: 

a) If the command <command> is not a jump-target command, then the 
previous command is replaced with the union of the previous command and the 
command <command>, and the command <command> is later removed. 
Throughout the present invention, the term "union of 2 commands" means a new 
command that is created by taking all the subcommands of the first command and 
all the subcommands of the second command. 

b) If the command <command> is a jump-target command, and the 
command <command> contains jump-subcommand JUMP or jump-subcommand 
RETURN, then the previous command is replaced with the union of the previous 
command and the command <command> 

[00154] For example, the following are the commands 35 and 36 of the processor program 
ORDER shown in FIG. 2: 

35: SETjk; 
36: JUMP 24; 

The current command <command> is the command 36, and the previous command is the 
command 35. Since all the conditions of the case a) are met, the command 35 is replaced 
with the union of the commands 35 and 36, and the command 36 is removed. Thus, the 
following may be obtained: 

35: SETjk; JUMP 24; 
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b. Scenario B 

[00155] If the previous command has jump-subcommand(s) but has no jump- 
subcommands of types CALL, RETURN and JUMP (consequently, it has jump- 
subcommands of types LOOP*, ZEROJUMP and NONZERO_JUMP only), and the 
command <command> does not depend from the previous command, then consider the 
cases a), b) and c) as follows: 

a) If the command <command> is not a jump-target command, and 
SIZE(previous command)+SIZE(<command>) <= N, then the previous command 
is replaced with the union of the previous command and the command 
<command>, and later the command <command> is removed; 

b) If the command <command> is a jump-target command and contains 
jump-subcommand JUMP or jump-subcommand RETURN, and SIZE(previous 
command)+SIZE(<command>) <= N, then the previous command is replaced 
with the union of the previous command and the command <command>; 

c) If the command <command> is a jump-target command, the case b) 
above is not applicable, and SIZE(previous command)+SIZE(<command>) < N, 
then the previous command is replaced with the union of the previous command 
and the command <command>, and later the subcommand JUMP <ADDR+1> is 
appended to the end of the new previous command, where ADDR is the address 
of the command <command>. 

[00156] For example, the following are the commands 7 and 8 of the processor program 
ORDER shown in FIG. 2: 

7: ZERO_JUMP ALD Z 13; 
8: READ y need_exch::posl; 
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The current command <command> is the command 8, and the previous command is the 
command 7. Since all the conditions of the case a) are met, the command 7 is replaced 
with union of the commands 7 and 8, and the command 8 is removed. Thus, the 
following may be obtained: 

7: ZEROJUMP ALD Z 13; READ y need_exch::posl; 

[00157] The following is another example: 

0: ZERO_JUMP x 2; 
l:ZERO_JUMPy 3; 
2: SETyx; 
3: FIN; 

Suppose the command with address 2 is the command <command>. The previous 
command is the command with the address 1, and the ADDR=2. The command 2 is a 
jump-target command because the command 0 has a jump-subcommand with address 2. 
Thus, the cases a) and b) are not applicable here and the case c) is applied. Thus, the 
following may be obtained: 

0: ZERO JUMP x 2; 

1 : ZERO JUMP y 3; SET y x; JUMP 3; 

2: SETyx; 

3: FIN; 

Note that the new previous command (the command with the address 1) is obtained as the 
union of old commands 1 and 2 and the subcommand JUMP 3. This transformation of 
the program may reduce the program execution time from 4 clock cycles to 3 clock 
cycles when x!=0 and y!=0. 
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c. Scenario C 

[00158] If the command <command> has a jump-subcommand <subcommand> of the 
type JUMP or CALL, the jump-target command of this subcommand has no jump- 
subcommand and does not depend from the command <command>, and 
SIZE(<command>)+SIZE(jump-target command)<=N, then all the subcommands of the 
jump-target command are inserted to the command <command> before the jump- 
subcommand <subcommand>, and the jump-address in the subcommand <subcommand> 
is increased by 1 . 

[00159] For example, consider the command 30 of the program ORDER shown in FIG. 2 
as a command <command>. It has the jump-subcommand CALL 6. The jump-target 
command is the command with address 6. Since all the conditions of Scenario C are met, 
all the subcommands of the command 6 are inserted to the command 30 before 
subcommand CALL 6, and the address 6 is replaced with the address (6+1) in the 
subcommand CALL 6. The obtained new command 30 is shows as follows: 

30: SET ALD 0 dir; SET CONST ALD1 0; SET CONST ALD OP 
"=="; CALL 7; 

d. Scenario D 

[00160] If the command <command> has a jump-subcommand <subcommand> of the 
type JUMP, and the jump-target command of this subcommand has some jump- 
subcommands and does not depend from the command <command>, then consider the 
following cases a), and b): 
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a) If the jump-target command has only one jump-subcommand 
<subcommand2> of types JUMP or CALL and has no more jump-subcommands 
(and it may have some non-jump-subcommands), and 
SIZE(<command>)+SIZE(jump-target command)<=(N+l), then all the 
subcommands of the jump-target command are inserted into the command 
<command> before the jump-subcommand <subcommand>, and the jump- 
subcommand <subcommand> is removed from the command <command>; 

b) If the jump-target command has some non-jump-subcommands of any 
types and some jump-subcommands of types LOOP*, ZEROJUMP and 
NONZERO_JUMP only, and SIZE(<command>)+SIZE(jump-target 
command)<=N, then all the subcommands of the jump-target command are 
inserted into the command <command> before the jump-subcommand 
<subcommand>, and the jump-subcommand <subcommand> is replaced with the 
subcommand JUMP <ADDR+1>, where the ADDR is the address of the jump- 
target command. 

[00161] For example, consider the following processor program: 

0: JUMP 2; 

1 : SET_CONST x 0; JUMP 4; 
2: SET_CONSTy0; 
3: JUMP 1; 
4: FIN; 

Suppose the command <command> is the command 3. The jump-subcommand 
<subcommand> is the subcommand JUMP L The jump-target command is the 
command 1, which has the JUMP subcommand. Since all the conditions of the case a) 
are met, the subcommands of the command 1 are inserted into the command 3 before the 
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subcommand JUMP 1, and this subcommand is removed. The following may be 
obtained: 

0: JUMP 2; 

1 : SET_CONST x 0; JUMP 4; 
2: SET_CONST y 0; 
3: SETCONST x 0; JUMP 4; 
4: FIN; 

The obtained program takes 4 clock cycles for execution, instead of 5 clock cycles 
required by the original program. 

[00162] Consider another example, where some small changes have been made to the 
jump-target command 1 in the above example, as follows: 

0: JUMP 2; 

1: SET_CONST x 0; ZEROJUMP z 4; 
2: SET_CONST y 0; 
3: JUMP 1; 
4: FIN; 

In this example, since all the conditions of the case b) are met, the subcommands of the 
command 1 are inserted into the command 3 before the subcommand JUMP 1, and this 
subcommand is replaced with a new JUMP (ADDR+1), where ADDR=1. Thus, the 
following may be obtained: 

0: JUMP 2; 

1 : SET CONST x 0; ZERO_JUMP z 4; 
2: SET_CONST y 0; 
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3: SETCONST x 0; ZEROJUMP z 4; JUMP 2; 
4: FIN; 

(5) Result 

[00163] FIG. 6 shows a result after the program shown in FIG. 2 is optimized in 
accordance with an exemplary embodiment of the method 300 shown in FIG. 3. Note 
that the processor program ORDER shown in FIG. 2 includes 38 commands and the 
optimized processor program shown in FIG. 6 includes 24 commands only. For the 
following input values of variables and arrays: dir=0, x[10]=(0,l,2,3,4,5,6,7,8,9), the 
processor program shown in FIG. 2 takes 1243 clock cycles for execution, but the 
optimized processor program shown in FIG. 6 only takes 682 clock cycles for execution. 

E. Advantages of present invention 

[00164] The present invention may have the following advantages. First, a simple 
processor language is developed so that the control device (processor) that executes the 
processor commands may be a very simple IC, which may be easy to create. At the same 
time, this processor language of the present invention allows several subcommands to be 
executed in parallel (for example, different arrays and variables may be read and written 
in parallel), which makes it possible to reach a high speed of program execution. 
Furthermore, a program written in C++ programming language may be translated into the 
processor language according to the present invention. Moreover, the program written in 
the processor language may be easily optimized according to the present invention in 
order to increase the performance. 

[00165] It is to be noted that the above described embodiments according to the present 
invention may be conveniently implemented using conventional general purpose digital 
computers programmed according to the teachings of the present specification, as will be 
apparent to those skilled in the computer art. Appropriate software coding may readily be 
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prepared by skilled programmers based on the teachings of the present disclosure, as will 
be apparent to those skilled in the software art. 

[00166] It is to be understood that the present invention may be conveniently implemented 
in forms of software package. Such a software package may be a computer program 
product which employs a storage medium including stored computer code which is used 
to program a computer to perform the disclosed function and process of the present 
invention. The storage medium may include, but is not limited to, any type of 
conventional floppy disks, optical disks, CD-ROMS, magneto-optical disks, ROMs, 
RAMs, EPROMs, EEPROMs, magnetic or optical cards, or any other suitable media for 
storing electronic instructions. 

[00167] It is understood that the specific order or hierarchy of steps in the processes 
disclosed is an example of exemplary approaches. Based upon design preferences, it is 
understood that the specific order or hierarchy of steps in the processes may be 
rearranged while remaining within the scope of the present invention. The accompanying 
method claims present elements of the various steps in a sample order, and are not meant 
to be limited to the specific order or hierarchy presented. 

[00168] It is believed that the present invention and many of its attendant advantages will 
be understood by the foregoing description. It is also believed that it will be apparent that 
various changes may be made in the form, construction and arrangement of the 
components thereof without departing from the scope and spirit of the invention or 
without sacrificing all of its material advantages. The form herein before described being 
merely an explanatory embodiment thereof, it is the intention of the following claims to 
encompass and include such changes. 
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